PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PDK_30s686701g002
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Arecales; Arecaceae; Coryphoideae; Phoeniceae; Phoenix
Family MYB
Protein Properties Length: 1610aa    MW: 174045 Da    PI: 5.7066
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PDK_30s686701g002genomePDKView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding21.84.4e-07690731346
                        SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
    Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                        +WT+ E e + +  + +G++ +++I++ ++ ++t  +c+++++k
  PDK_30s686701g002 690 PWTQGEKEVFMEMLATFGKD-FTKISSFLN-HKTTADCIEFYYK 731
                        8*****************99.*********.***********98 PP

2Myb_DNA-binding33.97.5e-11897938245
                        SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
    Myb_DNA-binding   2 grWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45 
                        g WT eE   +++a  ++G++ ++ I+r++g +R+++qck ++ 
  PDK_30s686701g002 897 GDWTDEEKSMFIRALSMYGKD-FAMISRCVG-TRSREQCKIFFS 938
                        78*****************99.*********.********8775 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.22E-14674736IPR009057Homeodomain-like
PROSITE profilePS5129314.371686737IPR017884SANT domain
SMARTSM007172.3E-7687735IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.608.3E-5689734IPR009057Homeodomain-like
PfamPF002491.9E-5690731IPR001005SANT/Myb domain
PROSITE profilePS5129313.784894945IPR017884SANT domain
Gene3DG3DSA:1.10.10.601.1E-6894939IPR009057Homeodomain-like
SMARTSM007172.8E-10895943IPR001005SANT/Myb domain
PfamPF002491.1E-8897937IPR001005SANT/Myb domain
SuperFamilySSF466891.09E-11897946IPR009057Homeodomain-like
CDDcd001674.48E-8899937No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1610 aa     Download sequence    Send to blast
MPFVSGVFDA SGTGQYRQGG GYHQLYPENP GAHGCTPSRS DRFWLEDEGF RPSSGRYGGG  60
GRSSSGGSRE SRGSFRRSPY WDSGDFSRQQ HHDPPVTAQR SVAVPISPAS QPPLKDQNDK  120
TGGAVDDGSG TGHRFDRDHS LGAMSWKPLK WSRAGSLSST KTGRSESEET GLEVLVPTGK  180
ETPIRSPVTS PVPSDEGASK KKPRLGWGQG LAKYEKQKVE GSLDVSGTAA KGALNETSPK  240
VVGLAGCPSP ATPGSVTCSS SPAGIEEKPC SKVVNGDNDT SHYGVSPEEF SNKLGHMEGN  300
PINMLTTLLA DLWQPDDAFA GDCTFSRQTA MNKLLLLKED ISKELEKTEW EIDLFENELK  360
SLNTDPENDP RQSSVTSPAN IAPELCIASS NVASKDSNPS KDHEFTSSAV TLVENDALPT  420
IALNEHDAEL KGVDVDSLQA VLSRFNNSAS SRKGVCDHET EKLAECSKIV ENDRFKVPEI  480
QHFVLSDDVE RTATVCDLGD GSRGEAGSSN DNGNSEASLH GKTDCNLITL IMASNRDAAK  540
KASQGAPPFV EGGSAIAFYK KTSYKVKWTF AFHMFIANEL MVVIACERSE NISAQCLPYF  600
WTLSCHRVYL CGLEAGNLTA GNLTLVPTTE IVEFTSKLLS DSQIKLYRNN LKMPSLILDE  660
KERKQTKFKT HNGLIEDPNS FEKERAMINP WTQGEKEVFM EMLATFGKDF TKISSFLNHK  720
TTADCIEFYY KNHKSESFRE VKKRLNLKKQ WQRLPTSSYL GTSGKKWNRE VNAASLDMLG  780
AASVVAAHSN GNVMSQQRYA GHGAHHGLKV SCGSYGSLDK VRCVEIPGHE RETVAADVLA  840
GICGALEAMS SCVTSAVDPV EKMNYTAKER PLTPEVTQNF DEDDTCSDEG CGELDSGDWT  900
DEEKSMFIRA LSMYGKDFAM ISRCVGTRSR EQCKIFFSKA RKCLGLDVIY QGTGNGGMPM  960
NDTNGGRSDT DDAYAAEMDS AICSTQSCSK MDTDVSQSVA NISSEGFVHA ASTPLQAETD  1020
KSSEQDVVGG INLEEDEGKV DKQASVLHDN KLASEVGNPQ AMQDADAALR CNASVQHEAV  1080
VSVDAEMKME GRSPIVSPVE PFLMVCMEVE SKSHVDDVVE QKDTGGSADV SKKEVDVSLL  1140
VPETGSRNRQ QSVDLGATNS GTICSVSDSE ADANALHPGS KDDVCPRSTF APIYHHQIQL  1200
DLLPCLQNKP QGFSLKQENP HSVPLNSLLP DPSSACFEGP RLVASQATSN FEEQGNKRHQ  1260
NPVARELYQV DQPLHMMRNP SLNQVDQPLH ILRGYPLQVL NPVEKEADPL IGENAVFMES  1320
HPKRNGVSQS NQFFTSEMYG DHCNGSNLSH LTPGVLFPPR NEAQPEAQLK HCSQNSCSEP  1380
EEQAHPTGDV KLFGKIICHP SSSQKSNSSS HDCNSKPSSP KMNRSSNLKS SNGGRAGALF  1440
ASRPGSSGHG GLGELPLRSY GFWDGNRIQA GFSSLPDSAV MLAKYQGSLA GMSFYSAKES  1500
VPSRNRILTD YQQSYMQHLS SDEKRLQSFC ELQKRNGIET VSGFQQQGRV ARLGSNMVGG  1560
GILGSGSGGG GGSGGGGVSD PVAALKMHYA ARAKVLSGEL ESWRGDIGGR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_D3e-14653739994NUCLEAR RECEPTOR COREPRESSOR 2
4a69_C3e-14653739994NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_008789086.10.0PREDICTED: uncharacterized protein LOC103706673 isoform X1
TrEMBLM0TC910.0M0TC91_MUSAM; Uncharacterized protein
STRINGGSMUA_Achr6P32970_0010.0(Musa acuminata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP62993549
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.15e-85MYB family protein